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a-pt P-based ir.<*W for integ er, r physical and genetic maps . 

•Field of the invention 
The present invention pertains to the field of biotechnology, more in particular to 
the field of genomic mapping and more in particular to the field of linking physical and 
genetic markers and to the field of integrating physical and genetic maps. The invention 
5 perfctins to a method for the integration of physical and genetic maps, more in particular to 
the high throughput building of integrated genetic and physical maps. 

Background of the invention 
Integrated genetic and physical genome maps are extremely valuable for map-based 

10 gene isolation, comparative genome analysis and as sources of sequence-ready clones for 
genome sequencing projects. The effect of the availability of an integrated map of physical 
and genetic markers of a species for genome research is enormous. Integrated maps allow 
for precise and rapid gene mapping and mapping of all microsatellite loci and other 
application such as SNA marker based gene manipulation. Various methods have been 

15 developed for assembling physical maps of genomes of varying complexity. One of the 

better characterised approaches use restriction enzymes to generate large numbers of DNA 
fragments from genomic subclones (Brenner et al, Proc. Natl. Acad. Sci.. (1989), 86, 
8902-8906; Gregory et al, GenomeRes. (1997), 7, 1162-116S; Marram al, Genome Res. 
(1997), 7, 1 072-1084). These fingerprints are compared to identify related clones and to 

20 assemble overlapping clones in contigs. The utility of fingerprinting for ordering a complex 
genome is limited, hoAvever, due to variation in DNA migration from gel to gel, the 
presence of repetitive DNA sequences, unusual distribution of restriction sites and skewed 
clone representation. Moreover, fingerprinting alone, unless combined with other methods, 
does not link genomic clones directly to genetic maps. Therefor most high quality physical 

25 maps of complex genomes have been constructed using a combination of fingerprinting 
and PCR-based or hybridisation based methods. 

Selective restriction fragment amplification or AFLP is known, for instance from 
the European patent application 0 534 858 and US 6,045,994 by applicant and from an 
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article by Vos et at. Nucleic Acids Research (1995), 23, 4407-4414, incorporated herein by 
reference. In general, AFLP comprises the steps or 

(a) digesting a nucleic acid, in particular a DNA or a cDNA, with one or more specific 
restriction endonucleases, to fragment said DNA into a corresponding series of 

5 restriction fragments; 

(b) ligating the reslriction fragments thus obtained with at least one double-stranded 
synthetic oligonucleotide adapter, one end of which is compatible with one orboth 
of the ends of the restriction fragments, to thereby produce tagged restriction 
fragments of the starting DNA; 

10 (c) contacting said tagged restriction fragments under hybridising conditions with at 

least one oligonucleotide primer; 

(d) amplifying said tagged restriction fragments hybridised with said primers by PCR 
or a similar technique so as to cause further elongation of the hybridised primers 
along me restriction fragments of the starting DNA to which said primers 

15 hybridised; and 

(e) identifying or recovering the amplified or elongated DNA fragment thus obtained. 
The amplified DNA-fragments thus obtained can then be analysed and/or 

visualised, for instance by means of gel-electrophoresis. This provides a genetic fingerprint 
showing specific bands corresponding to the restriction fragments which have been linked 

20 to the adapter, have been recognised by the primer, and thus have been amplified during the 
amplification step. The fingerprint thus obtained provides information on the specific 
restriction site pattern of the starting DNA, and thus on the genetic make-up of the 
organism from which said DTSTA has been derived. 

AFLP can therefore be used to identify said DNA; to analyse it for the presence of 

25 specific restriction site patterns, restriction fragment length polymorphisms (RFLPs) and/or 
specific genetic markers (so-called "AFLP-markers"), which may be indicative of the 
presence of certain genes or genetic traits; or for similar purposes, for instance by 
comparing the results obtained to DNA-samples of known origin or restriction pattern, or 
data thereon. AFLP is eminently suited to characterise genetic markers by means of one or 

30 more of the AFLP fragments thus visualised. 
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The primers used in AFLP are such that they recognise the adapter and can serve as 
a starting point fox the polymerase chain reaction. To this end, the primers must have a 
nucleotide sequence that can hybridise with (at least part of) the nucleotide sequence of the 
adapter adjacent to the 3' end of the restriction fragment to be amplified. The primers can 

5 also contain one or more further bases (called "selective bases") at the 3'-end of then- 
sequence, for hybridisation with any complementary base or bases at the 3 T -end of the 
adapter ligated restriction fragment, Located between the part of the primer that hybridises 
to the adapter and the selective bases that hybridise to the restriction fragment, the primer 
may contain a section that is capable of hybridising to the remains of the restriction site. 

10 Thus, in general an AFLP primer has the following structure: adapter complementary part- 
restriction site remains complementary part-selective bases. The adapter complementary 
part- restriction site remains complementary part is generally depicted as the 'constant 
sequence 7 of the AFLP primer and the selective bases as the Variable sequence'. 

As, of all the adapter-ligated restriction fragments present in the mixture, only those 

15 fragments that contain bases complementary to the selective bases will subsequently be 

amplified, the use of these "selective'' primers will reduce the total amount of bands in the 
final fingerprint, thus making the fingerprint more clear and more specific. Also, the use of 
different selective primers (i.e. different variable sequence) will generally provide different 
fingerprints, which can also be used as a tool for the purposes of identification or analysis. 

20 The selective nucleotides are complementary to the nucleotides in the adapter- 

hgated restriction fragments that are located adjacent to the constant primer sequence. 
Primers containing selective nucleotide are denoted as +N primers, in which N 
stands for the number of selective nucleotides present at the 3 '-end of the primer. N is 
preferably selected from amongst A, C, T or G. 

25 N may also be selected from amongst various nucleotide alternatives, i,e, 

compounds that are capable of mimicking the behaviour of ACTG-nucleotides but in 
addition thereto have other characteristics such as the capability of improved hybridisation 
compared to the ACTG-nucleotides or the capability to modify the stability of the duplex 
resulting from the hybridisation. Examples thereof are peptide nucleic acids (PNAs) ? 

30 locked nucleic acids (LNAs), inosine etc. When the amplification is performed with more 
than one primer, such as with PCR using two primers, one or both primers can be equipped 
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with selective nucleotides. The number of selective nucleotides may vary, depending on the 
species or on other particulars determinable by the skilled man. In general the number of 
selective nucleotides is not more than 10. but at least 5, preferably 4, more preferably 3, 
most preferred 2 and especially preferred is 1 selective nucleotide. 
5 A +1 primer thus contains one selective nucleotide, a +2 primer contains 2 selective 

nucleotides etc. A primer withno selective nucleotides (i.e. a conventional primer) can be 
depicted as a +0 primer (no selective nucleotides added). When a specific selective 
nucleotide is added, this is depicted by the notion +A or +C etc. 

By amplifying a set adapter ligated restriction fragments with a selective primer, a 
10 subset of adapter-ligated restriction fragments is obtained, provided that the 

complementary base is present at the appropriate position in the restriction fragment, Using 
a +1 primer, for example, the complexity (and the number of visualised fragments) of the 
amplified mixture is reduced by a factor 4 compared to a amplification with a non-selective 
primer (a +0 primer) Higher reductions can be achieved by using primers with multiple 
15 selective nucleotides, i.e. 16 fold reduction of the original multiplex ration is obtained with 
2 selective nucleotides etc. 

As AFLP provides amplification of both strands of a double stranded starting 
DNA AFLP advantageously allows for exponential amplification of the fragment, i.e. 
according to the series 2, 4, 8, l6, etc.. Also, AFLP requires no prior knowledge of the 
20 DNA sequence to be analysed, nor prior identification of suitable probes and/or the 
construction of a gene library from the starting DNA. 

For a further description of AFLP, its advantages, its embodiments, as well as the 
techniques, enzymes, adapters, primers and further compounds and tools used therein, 
reference is made to EP-0 534 858, and to Vos et al Nucleic Acids Research (1995), 23, 
25 4407-4414 both publications are incorporated herein by reference. Also, in the description 
heieinbelow, the definitions given in paragraph 5.1 of EP-0 534 858 will be used, unless 

indicated otherwise. 

The potential of AFLP as a technology for the integration of physical and genetic 
maps has been recognised before. Klein et al. in Genome Research, (2000), 10, 798-807 
30 have described the use of AFLP in the integration of physical and genetic maps of 

Sorghum. The method of Klein &t al. comprises generating AFLP fingerprints using +3/+3 
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selective primers of aU individual BAC clones in a library. The method further comprises 
the generation of pools of clones. The pools are also analysed by AFLP fingerprinting 
albeit using different restriction enzymes and otherwise other circumstances compared to 
the generation of the AFLP fingerprints of all the individual BACs. The use of different 
5 enzyme combinations renders the method difficult. The method of Klein et al. although 
feasible, is complex and laborious and comes with several other disadvantages. One of 
them is that the method is unsuitable for positioning non-polymorphic markers on the 
integrated map. 

It is a goal of the present invention to provide for an improved method for the 
10 integration of physical and genetic maps. It is a further goal of the present invention to 

provide for an improved method based oh AFLP. It is yet a further goal of the invention to 
provide for a high throughput method for the integration of physical and genetic maps that 
results in maps of improved quality such as measured by increased marker densities or as a 
result of more reliable contig generation, 

15 

Description of the invention 
The present invention pertains to methods for the integration of physical and 
genetic markers and for the integration of physical and genetic maps. The present invention 
overcomes many of the previously encountered problems in the art and provides for 

20 significant improvements and advantages that will become clear through this description, , 
the claims and the examples. The method is based on AFLP and more in particular based 
on the use of combinations of selective primers in AFLP. The method results in the 
integration of genetic and physical maps and the linking of physical and genetic markers. 
The invention further pertains to the use of AFLP primers in the methods of the invention 

25 and to a set of structurally defined AFLP primers. 

Detailed description of t he invention 
la one aspect the invention pertains to a method for the integration of physical and 
genetic maps by associating a restriction fragment, preferably an AFLP fragment with a 
30 genetic marker, providing a library of clones containing inserts of the (part of) genome of 
interest, pooling the clones into pools and fingerprinting the pools using AFLP primer-pairs 
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of higher selectivity, individually fingerprinting the clox.es in a library using AFLP primer- 
pairs of lower selectivity. The pools that contain a fragment corresponding to the fragment 
associated with the genetic marker are identified and the individual clones contammgthe 
fragment corresponding to the fragment associated with the genetic marker are identrired 
5 and linked. From the fingerprint patterns of the individual clones containing the fragment 
associated with the genetic marker a contig is generated. The contig is linked to the geneUc 
marker on the genetic map and the process is repeated for all genetic markers, thereby 
providing an integrated physical and genetic map. 

Thenotion 'primer-pairs of higher/lower selectivity as used herein relates to the 
10 number of selective nucleotides in the AFLP primers used for fingerprinting. Thus a +0/+0 
primer-pair is of a lower selectivity than a +0/+1 primer-pair or a +1/+0 primer-pam 
Similarly a +3/+4 primer-pair is of a higher selectivity than a +2/+2 primer-pair etc. 

In one aspect the invention pertains to a method for linking a genetic and a physical 
genome map comprising the steps of: 
15 (a) providing individual genetic markers, preferably in the form of a genetic 

map; 

(b) characterising each of the genetic markers by means of at least one AFLP 
fragment identified through AFLP fingerprinting; 

(c) providing an artificial chromosome library such as a BAC or YAC; 
20 (d) generating a multitude of pools, each pool containing a multitude of 

artificial chromosomes from the library; 

(e) generating an AFLP fingerprint for each of the pools; 

(f) selecting from the multitude of pools the individual pools in which the at 
least one AFLP fragment associated with a genetic marker is present in the 

25 fingerprint of the individual pool; 

(S) generating an AFLP fingerprint of the individual artificial chromosomes 
from fire pools identified under (f); 

(h) selecting from the fingerprints of the individual artificial chromosomes of 
step (g) the individual artificial chromosome(s) in which the AFLP fragment 

30 associated with the genetic marker is present in the fingerprint; 

(i) generating a contig of the individual artificial chromosomes identified in 
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step (h); 

0) linking the contig obtained in step & to the genetic maiker on the genetic 
map, thereby obtaining a linkage between the physical marker and the 
genetic marker; 

5 (k) repeating steps (g-j) for all genetic markers and align Hie contigs obtained to 

thereby obtain an integrated physical and genetic map; 

wherein the forward and reverse AFLP primers used in step (b) and (f) comprise K 

respectively L selective nucleotides at the 3'-end of the primer, wherein the forward and 

reverse AFLP primers used in step (d) comprise M respectively N selective nucleotides at 
10 the 3'-end of the primer, wherein K, L, M,N are integers from 0 to 10, and wherein K+L-fe 

M+N. 

The method of the invention is flexible. Flexibility in this respect refers to the 
starting material. The method of the invention may start from whole genomes of any 
species, but is also applicable to only apart of the genome or a selected region of the 
1 5 genome or a chromosome or part thereof. 

The method starts by using individual genetic markers. It is preferred that these 
markers have been integrated on a genetic map, but this is not a pre-requisite. The method 
of the inventions is equally well suited to link physical markers to genetic markers, after 
which the physical markers are aligned to from a physical map and at the same time 
20 provide a genetic map. The genetic map may comprise only one single genetic marker but 

preferably the map comprises two or more genetic markers. Each of the genetic markers is 
characterised by an AFLP fragment or by a combination of AFLP fragments. The second 
step is the provision of a library of artificial chromosomes. The library can be a Bacterial. 
Artificial Chromosome library (BAC) or based on Yeast Artificial Chromosome (YAC). 

25 Other libraries such as based on Cosmids, PAC, TAC or MAC are also possible. Preferred 
is a BAC library. The library is preferably of a high quality and preferably is a high insert 
size genomic library. This means that the individual BAC contains a large insert of the 
genomic DNA under investigation. The size of the preferred large insert is species- 
dependent. Throughout this application reference is made to BACs as examples of artificial 

30 chromosomes. However, it is noted that the present invention is not limited thereto and that 
other artificial chromosomes can be used without departing from the gist of the invention. 
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Preferably the libraries contain at least five genome equivalents, more preferably at least 7, 
most preferably at least 8, Particularly preferred is at least 10. The higher the number of 
genome equivalents in the library the more reliable is the resulting contig and the integrated 
maps. 

5 The individual clones in the library are pooled to form pools containing a multitude 

' of artificial chromosomes or clones. The pooling maybe the simple combination of a 
number of individual clones into one sample (for example, 100 clones into 10 pools of 10 
clones), but also more elaborate pooling strategies maybe used. The distribution of the 
clones over the pools is preferably such that each clone is present in at least two or more of 

10 the pools. The pools are generated based on pooling strategies well known in the art The 
skilled man is capable selecting the optimal pooling strategy based on factors such as 
genome size etc. The resulting pooling strategy will depend on the circumstances, and 
examples thereof are plate pooling ^dimensional pooling such as 3D-pooling, 6D-pooling 
or complex pooling. 

15 Each of the pools is fingerprinted using AFLP and preferably with primer-pairs 

having a high selectivity to thereby generate fingerprints that contain a limited number of 
bands or fragments. Preferably the primer-pairs used in the fingerprinting of the pools axe 
the same primer-pairs that are used to identify the AFLP fragment characteristic of the 
genetic markers in step (b). The primer-pairs with a high selectivity contain more selective 

20 nucleotides than the primer with a low selectivity and are for instance +3/+3 primer-pairs 

or +2/+3 primer-pairs. 

The fingerprints of the pools are subsequently screened for the presence or absence 
of the fragments of interest that are associated with the genetic marker(s). From apool (or 
pools) in which the desired fragment has been found present, each of the individual clones 

25 in that pool (or pools) is fingerprinted using AFLP. The individual clones are preferably 
fingerprinted with primer-pairs having a low selectivity to thereby generate fingerprints 
that contain a large number of bands or fragments that later aid in the generation of reliable 
contigs. In general in this method, the AFLP fingerprints of the genetic markers, the pools 
and the individual clones are preferably obtained using the same combinations of 

30 enzyme(s) and adapters. Preferably only the primers are different and preferably only the 
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seleotive bases are different in number and/or in type. The primer-pairs with a low 
selectivity are for instance +0/+0 primer-pairs or +1/+0 primer-pairs. 

The fingerprints of the individual clones that have been obtained are now subjected 
to a selection step whereby those fragments are selected that are most likely connected to 
5 the genetic marker of interest This is achieved by subjecting the individual clones 
containing the desired fragment to a election step, preferably in silico. 

In this selection step, the fmgeiprinting patterns of the individual clones are 
compared. When fingerprinting patterns are compared that axe derived from clones with 
overlapping inserts, the respective fingerprints will contain one or more bands/fragments 
10 that are similar in place, intensity, type etc. (See for example Fig 1., the bands are marked 
as #, A $). With fingerprint patterns that are derived from clones that do not have 
overlapping inserts the chance that one or more bands express such a similarity is 
significantly lower. When, based on the fingerprint patterns, it is likely that the associated 
inserts of the clones are overlapping inserts, alignment in silico results in the generation of 
1 5 a contig. This contig is then linked to the genetic marker and the genetic marker can be 

positioned on the genetic map. Repeating these steps for various genetic markers ultimately 
results in positioning of all contigs on the genetic map and in the integration of the physical 
and genetic map. 

The advantage of the fingerprinting of the clones using primers with low selectivity 
20 is that many fragments are amplified. Consequently, a large number of physical markers are 
identified. Typically 4-5 times more scorable fragments are generated with a +0/+) 
amplification compared to +2/+3 amplification and leads consequently to a 4 or 5 times 
higher resolution of the contigged clones. In mis way, a high-resolution physical map is 
generated. This is advantageously in me generation of contigs and it will reduce the number 
25 of singletons that may occur in the generation of contigs. 

The term 'contig' as used herein refers to a continuous sequence of DNA that has 
been assembled from overlapping cloned DNA fragments, following the definition 
provided in The Encyclopedia of Molecular Biology (1994, Blackwell). 

In general, the primers of low and high selectivity are characterised in that the 
30 forward and reverse AFLP primers of high selectivity used in the fingerprinting of the 
individual clones and in the identification of the AFLP fragment associated with the 
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genetic marker comprise K respectively L selective nucleotides at the 3 '-end of the primer, 
wherein the forward and reverse AFLP primers used in the fingerprinting of the individual 
clones from the library comprise M respectively N selective nucleotides at the 3 '-end of the 
primer, wherein K, L, M, N are integers from 0 to 10, and wherein K+L SM+N. Thus the 
5 primers of high selectivity contain at least one selective nucleotide more than me primers 
of low selectivity. 

Alignment is performed by comparing band patterns and intensities of the 
fingerprint to identify overlapping patterns. Clones containing overlapping patterns overlap 
can be aligned based thereon. Due to the use of AFLP with low selective primers, the 

10 physical markers present in the clones are abundant and hence the alignment thereof can be 
performed with great accuracy. The contig can be generated using any means known in the 
art. The clones can be aligned using software well known in the art for these purposes such 
as FPC (Soderlund C, L Longden, R. Mott, 1997, FPC: a system for building contigs from 
restriction fingerprinted clones. Comput. Applic Biosci. 13:523-535). This software can be 

15 used in the method ofthe present invention with cut-off parameters that are between 10" 5 - 
10' 15 , preferably between 10" 6 - 10" 14 , more preferably between 10' 7 - 10" 13 , in particular 
between 10' 8 - 10" 12 , more preferably between 10' 9 - 10"" , and most preferred 10" 10 . Cut- 
off is a parameter that that represents the threshold value representing the maximum 
allowable probability of a chance match between any two clones. Tolerance is a parameter 

20 that is a measure for the maximum distance or bandwidth which two bands from two 
different clones can differ and still be considered the same band. In this invention, 
tolerance is rated from 0 (identical) to 5 (difference of 0.5 base pair). Preferably, tolerance 
is as low as possible (i.e. 0), but in practice a value of 3-4 (difference of 0.3-0.4 base pair) 
is acceptable and 1-2 is preferred. 

25 The generated contigs are then linked to the genetic marker on the genetic map and 

the whole process is repeated for the other genetic markers ofthe genetic map thereby 
irrtegrating the genetic map with the physical map. 

The method according to the invention has certain advantages over methods 
disclosed hi the art. One of the advantages is that the resulting integrated map has high 

30 resolution due to the fingerprinting ofthe individual clones with primer-pairs of low 

selectivity. The amplification step of AFLP with low selective primers will result in the 
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amplification of more adapter-ligated restriction fragments and consequently to more 
physical markers per clone as compared to similar fingerprinting with primers of high 
selectivity. Due to this increased number of physical markers per clone the subsequent 
generation of a contig is also of a better quality and more clones can be correctly aligned 

5 resulting in an improved integration of the physical and genetic map. 

Another advantage is in the speed of the generation of the contig. With the method 
of the present invention relative small sets of ^dividual clones are selected that are 
associated with the AFLP fragment of the genetic marker of which a contig is generated. 
Small sets have the advantage that the generation of a contig is significantly faster than 

10 when larger sets are used. After placement of me contigs in the genetic map, a more 
reliable integrated map is obtained by ahgning the placed contigs and verifying the 
alignments with the position of the genetic marker on the genetic map. One of the further 
advantages is that the relative genetic distance (generally in centimorgan) are now 
converted to physical distances (in basepaixs) and may provide a correction or enhancement 

1 5 of the genetic map, for instance by reversing the order of genetic markers or by more 

accurately determining the distance between markers. One of the most distinct advantages 
is that each step is verifiable and correctable. This means that in case an amplification or a 
detection, for instance on a gel, is not of the desired quality, this step can be immediately 
corrected until data of the desired quality are obtained. 

20 A significant advantage of the method of the invention over the methods known in 

the art is that the present method provides for the detection of non-polymorphic markers 
and their subsequent integration in the integrated map. Another advantage associated with 
the method of the invention is that the generation of physical mapping data and linkage to 
the genetic map are achieved directly from the start of the method (for each marker 

25 separately). 

In one embodiment the method can also be used for the linking or integrating of a 
physical and genetic marker. The method then comprises the steps of: 

(a) characterising the genetic marker by means of at least one AFLP fragment 
identified through AFLP fingerprinting; 
30 (b) providing an artificial chromosome library such as a BAC or YAC; 
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generating a multitude of pools, each pool containing a multitude of artificial 
chromosomes from the library; 
generating an AFLP fingerprint for each of the pools; 
selecting from the multitude of pools those individual pools in which the 
AFLP fragment associated with the genetic marker is present in the 
fingerprint of the individual pool; 

generating an AFLP fingerprint of the individual artificial chromosomes from 
the pools identified under (f); 

selecting from the fingerprints of the individual artificial chromosomes of 
step (g) the individual artificial chromosomes in which the AFLP fragment 
associated with the genetic marker is present in the fingerprint; 
generating a contig of the individual artificial chromosomes identified in step 

CO; 

linking the contig obtained in step (i) to the genetic marker on the genetic 
map thereby obtaining a linkage between the physical marker and the genetic 
marker; 

wherein the forward and reverse AFLP primers used in step (b) and (f) comprise K 
respectively L selective nucleotides at the 3 '-end of the primer, wherein the forward and 
reverse AFLP primers used in step d comprise M respectively N selective nucleotides at the 
20 3'-end of ihe primer, wherein K, L, M, N are integers from 0 to 10, and wherein K+L ^ 
M-HSL 

In this embodiment the pres ent invention is used for the single integration of a 
genetic marker with a (group of) physical marker(s). By repeating these steps for all 
available genetic markers, the complete integrated map can be constructed, but in this 

25 embodiment this is entirely optionaL 

In one embodiment, the difference in the number of nucleotides that are used in the 
set of primers with low selectivity compared to the set of primers of high selectivity, also 
depicted as (K+L) - (M+N), is at least 1, preferably at least 2, more preferably at least 3, 
most preferably at least 4. In one embodiment, the number of selective nucleotides Used in 

30 the set of primers of low selectivity, also depicted as is M+N s is at least 0 3 preferably at 
least 1 , more preferably at least 2, most preferably at least 3 . 
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The pools of the individual clones that are used in the method of the present 
invention preferably contain between 0.001 and 1 genome equivalent of the total genome to 
be analysed, preferably between 0J and 0.75, more preferably between 0.15 and 0.60, most 
preferably between 0.20 and 0.50. More preferred is between 0.25 and 0.3 S 7 more 
5 preferably 0,4, most preferably 0.3. 

In general, a pool of individual clones is Limited in the numbe* of clones it can 
contain. In case of exceptionally large genomes, libraries or very small inserts, it may be 
advantageously to introduce and additional pooling step. Therefore in a preferred 
embodiment of the invention, the method comprises an. additional pooling step. The 
10 additional pooliag step results m pools that can be fingerprinted using primers with the 
same (Mgh) selectivity as are used for the fingerprinting of the pools of the initial pooling 
step. It is also possible to use primers of a intermediate selectivity, that is, primers with a 
number of selective nucleotides (P+Q) between K-HL and M+N such tibat K+L Z> P+Q ^ 
M+N, 

15 En an alternative embodiment, it is possible to avoid the use of pools of clones 

altogether by generating a contig of all individual clones that have been fingerprinted using 
primers with low selectivity, Optionally a pre-selection of the individual clones is made 
wherein the clones at least contain the fragment that is associated with, the genetic marker. 
More in detail: the individual fingerprints obtained using primers of low selectivity are 

20 screened for the presence of the (fragment) marker associated with the genetic marker that 
has been identified using the primers with high selectivity. This embodiment is 
advantageously for instance when smaller genomes are investigated or when only a few 
genetic markers are integrated with the physical markers. In such cases there is no real need 
for a pooling step. 

25 In one embodiment to characterise all genetic markers available by AFLP it may be 

preferred to use different primers and/or primer combinations for different genetic markers. 
The primers and/or primer combinations for the fingerprinting of the pools and the 
individual clones can thm to be adapted accordingly. It is even possible to use different 
enzyme combinations for different genetic markers. The associated contigs of BACs can 

30 be aJigned based on the overlap of the BAC contigs and the occurrence of identical 
physical markers. This is schematically depicted in Fig 3 . 
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The method of the invention is in principle suitable for the linkage of one or more 
physical markers to a genetic marker and for the linkage of physical markers to the genetic 
map comprising a multitude of genetic markers, It is preferred that the genetic markers are 
spread evenly across the map. It is farther preferred that the genetic map is a map that is 
5 regarded in the art as a genetic map with a high marker density. This facilitates the linkage 
of (the contigs of) the physical markers to the genetic map and enhances the quality and 
reliability of the resulting integrated physical and genetic map. Preferably the genetic map 
has an (average) density of at least 1 genetic marker per 1000 kb, preferably per 500 kb 
more preferably per 200 kb, most preferably per 100 kb. Particularly preferred is a genetic, 

10 map with a density of at least 1 genetic marker per 50 kb. 

When all genetic markers are linked to a contig derived from the individual clones 
in the library, gaps may remain and/or clones may remain that have not been placed on the 
map. Due to the increased density of the physical markers obtained by the fingerprinting of 
the individual clones with primers of low selectivity, the placement of these clones can "be 

15 accomplished using conventional alignment software, and using techniques such as BAC 
walking and using the BAC end sequences. 

In another aspect, the present invention pertains to the use of AFLP primers for the 
integration of genetic and physical maps and to the use of AFLP and AFLP primers in 
linking genetic and physical genome maps. AFLP and AFLP primers have demonstrated to 

20 provide an excellent technique for the integration of the physical and genetic maps. 

In a further aspect the invention pertains to the use of a first and a second pair of 
AFLP primers in a method for linking genetic and physical genome maps wherein the first 
pair of AFLP primers comprise K respectively L selective nucleotides at the 3 '-end of the 
primer, wherein the second pair of AFLP primers used comprise M respectively N selective 

25 nucleotides at the 3 '-end of the primer, wherein K, L, M, N are integers from 0 to 1 0, and 
wherein K+L ^M+N. 

By linking the physical contigs to the genetic markers a first integrated map can be 
obtained. Based thereon further improvement, correction and refinement of the genetic map 
is possible by generating a contig of the various physical markers linked to a genetic 

30 marker. 
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The invention provides for the construction of high resolution integrated genetic 
and physical map of any genome without the need for prior knowledge of the sequence. 
The invention further facilitates the efficient construction of detailed physical maps of, 
optionally pre-selected, regions of any genome. The invention further facilitates the 
5 discovery of novel BAG derived,. (AFLP) markers, both polymorphic and non-polymorphic 
for the integrated genetic map. 

Description of the Figures 
Figure 1 discloses a schematic representation of the integration of a physical and a genetic 

10 map, The method starts by identifying genetic markers using AFLP fmgexprinting- In this 
case a genetic marker was identified using HindUL and Ms el as restriction endonucleases in 
the AFLP fingerprinting and Hind+2 and Mse+3 high selective primers carrying AT and 
CCA as selective nucleotides at their 3' end, respectively. The identified AFLP fragment 
associated with the genetic marker has a length of 344 bp. A B AC library is prepared 

15 comprising Z clones. The individual clones are pooled (pool I contains clones 1, 2 and 3; IE 
contains 3 7 5 and Z; and IH contains 1 A and 5) and the pools are fingerprinted using AFLP 
with identical enzymes and primers with the same selectivity as used in the identification 
of the genetic marker (Hind+2(AT) and Mse-K3(CCA)). This positively identifies those 
pools that contain the fragment AT/CCA-344 (pools I and JS). From each clone present in 

20 the pool (pools I and TO) that contains the fragment of interest, (1, 2, 3, 4, 5) an AFLP 

fingerprint is generated, using primers of lower selectivity, in this case +0/+0. Each of the 
individual clones is fingerprinted and bands with 344 bp length can be identified ( in 2, 3 5 
4, 5). These bands may very well be unrelated to the genetic marker of interest, as they also 
include any fragment that contains other nucleotides than the AT/CCA combination at the 

25 respective ends. From the individual fingerprints containing the 344 fragment the clones 

are selected in silico that are most likely to be linked to the AFLP fragment of interest. This 
selection step is based on the comparison of one or more bands in the fingerprint patterns. 
The more corresponding bands are found in the fingerprints (the bands are depicted as #, A 
$), the higher the likelihood that these fingerprints are able to form a contig and thus align 

30 the individual clones from the library to form a contig, The individual clones that contain a 
fragment with a length of 344 bp and that are selected based on the in silico process of the 
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fingeiprint patterns are aligned using FPC to form a contig. This contig can be placed on 
the genetic map- False positive fragments (i.e. tibtat contain the fragment AT/CCA-344, but 
are not linked to the genetic marker of interest) will not form a contig under the conditions 
used. Repeating these steps for other genetic markers results in the formation of an 
5 integrated physical and genetic map. 

Figure 2: A: HindJJl+2/MseI+3 fingerprints of BAC pools of Pristionchus pacificus with 
the previously identified AFLP fragments characteristic of the genetic markers indicated by 
arrows. 

B: Htntini+O/Msel+Q fingerprints of individual BACs of corresponding positive pools in 
10 Fig 2A.and indication of the characteristic AFLP bands (BAC1 and BAC2). 

C: Corresponding Hindm+O/MseI+0 BAC contig and identification of the BAC1 and 
BAC2 clones. 

Figure 3: Schematic representation of the method for the integration of physical and 
genetic maps, whereby for different genetic markers the linkage between the physical BAC 

15 clones and the respective genetic markers is obtained by fingerprinting the BAC pools and 
the individual fingerprints with different restriction enzyme combinations in. the AFLP 
fingerprinting for the different genetic markers. A, B and C are genetic markers. KE1 and 
RE2 are different restriction enzyme combinations (for instance HmdlE/Msel and 
EcoRI/Msel). 1-8 are BAC clones. 

20 A genetic marker A has been linked to physical BAC clones 1-4 using enzyme combination 
RE1 in the AFLP fingerprinting of the donoipaTents, the BAC pools and the individual 
BACs according to the method of the invention. Genetic marker O has been mapped to BC 
clones 6, 9 and 10 using the same enzyme combination RE1. Genetic markers X and C 
have similarly been mapped to BAC clones 2,5, 8 and 1,6,7 respectively, using enzyme 

25 combination RE2. Based on the common presence of BAC clones 1 and 2 in A and in B 
and C respectively, the genetic marker A identified with a different enzyme combination 
fiian genetic markers B and C can now be placed on the common integrated map. Based on 
the common presence of BAC clone 6 in O and C respectively, genetic marker T can be 
located adjacent to genetic marker C on the integrated map, thereby providing for an 

30 integrated map of genetic markers T, A, C and O that have been identified using different 
AFLP enzyme combinations. 
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Examples 

5 EAC library construction 

High molecular weight DNA of the nematode Pristionchus pacificus was used to 
construct aBAC library, Partially HintiKL digested DNA of Pristionchus pacificus was 
separated with pulsed field gel electrophoresis and ligated into the pIhdxgoBAC-536 vector. 
The ligated products were elecfroporated into electrocompetent DH10B E, coli cells and 
0 the average insert size was determined on field inversion gel electrophoresis. The 

Pristionchus pacificus BAC library contains 13.500 clones with an all over average insert 
size of >125 kb. The estimated genome coverage of the library is estimated to be at least 7 
assuming a 200 Mb total genome size of Pristionchus pacificus. 

5 The pooling of the BAC library 

The pooling of the BAC library was perfoimed by pooling individual 96-well 
plates. Each 96-well plate was inoculated on a standard TY-agarplate with 
chloroamphenicol selection (12.5 mg/ltr) ? and incubated overnight at 37 Celsius. In the 
morning the 96 colonies were resuspended in 1 ml of liquid TY medium and collected 
0 from the agarplates. A standard miniprep procedure was performed and the pooled BAC 
DNA was used for AFLP template preparation. 

DNA preparation of individual BACs and BAC pools 

DNA from individual BACs was prepared according to standard procedures 
5 (Sambroolc, L 5 Fxitsch, E.F., audManiatis, T~, iaMolecular Cloning: A Laboratory Manual. 
Cold Spring Harbor Laboratory Press, NY> Vol. I, 2, 3 (1989).) An overnight 1.5 ml 
culture in TY medium and chloroamphenicol (12.5 mg/ltr) selection generates sufficient 
DNA for AFLP template preparatioix. 



AFLP analysis of individual BACs and BAC pools 
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AFLP templates of individual BAC DNA, the BAC pools and the two donorpaxents 
that were used in the genetic mapping population were prepared according to standard 
procedures (Vos et al Nucleic Acids Research (1995), 23, 4407-4414 ) using HinSHL and 
Msel as the enzyme combination. 

For the +0/+0 amplification of the individual BAC fragments, no preampliflcation 
step was necessary. The +0/+0 amplification step was directly performed on the 1:10- 
diluted template of the individual BAC DNA, The amplified +0/+0 fragments were 
detected and sized on aMegaBACE 1000 capillary electrophoresis platform. 

AFLP reactions on the BAC pools were preceded by preamplificarions with 

one selective nucleotide on each primer. AFLP reactions were performed with primer 
combinations having 2 selective nucleotides on the HindUE primer and 3 selective 
nucleotides on the Msel primer. 

The amplified +2/H-3 AFLP fragments were detected and sized on the same 
MegaBACE 1000 capillary electrophoresis platform as the +0/+0 fingerprints. 

AFLP adaptor prime rs f5 ? -3',) 

The following primers were used to generate the fingerprints of the BAC clones 
91M35 : iSwdlH-adaptor primer forward (5^3*) : CTCGTAGACTGCGTACC 
91M37 : jffiwdDI-adaptor primer reverse (5'-3>) : AGCTGGTACGC AGTCTAC 
92A18 : Afcsl-adaptor primer forward (5 3 -3 9 ) r GACGATGAGTCCTGAG 
92A19 : A&el-adaptor primer reverse (5>-3') : TACTCAGGACTCAT 

Non selective primer sequences f5'-3 ?> ) 

93Q34 : J3fadHI+0 primer (5 ? -3 7 ) : GACTGCGTACCAGCTT 

93E40 :MseI+0 primer (5'~3') : GATGAGTCCTGAGTAA 

AFLP selective primer sequences (5'-3^ 

Hindni+2 primers can be labeled with FAM, JOE or NED for detection on MegaBACE 
platform. 

Hindm +2 primers (5 3 -3') : 
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Constant sequence: GACTGCGTACCAGCTT 
Selective nucleotides: 



AA 


CA 


GA 


TA 


AC 


CC 


GC 


TC 


AG 


CG 


GG 


TG 


AT 


CT 


GT 


TT 



Msel +3 primers (5'-3') : 

Constant sequence: GATGAGTCCTGAGTAA 
5 Selective nucleotides: 



AAA 


TAA 


CAA 


GAA 


AGA 


TGA 


CGA 


GGA 


AAC 


TAC 


CAC 


GAC 


AGC 


TGC 


CGC 


GGC 


AAT 


TAT 


CAT 


GAT 


AGT 


TGT 


CGT 


GGT 


AAG 


TAG 


CAG 


GAG 


AGG 


TGG 


CGG 


GGG 


ACA 


TCA 


CCA 


GCA 


ATA 


TTA 


CTA 


GTA 


ACC 


TCC 


CCC 


GCC 


ATC 


TTC 


CTC 


GTC 


ACT 


TCT 


CCT 


GCT 


ATT 


TXT 


CTT 


GTT 


ACG 


TCG 


CCG 


GCG 


ATG 


TTG 


CTG 


GTG 



Software analysis 

All MegaB ACE traces were analysed by in-house developed software and the . 
resulting datasets of the ffiwM+0 / MseI+0 individual fingerprints were processed by FPC 

10 software (Soderlund et al vide supra) The HindHL+2 1 MseB-3 AFLP fingerprints of the 
pooled BAC DNAs and the donorparents of the genetic mapping population were also . 
analysed by in-house software. The link of the HinAm+2 I Msel+3 fragment to the 
HindSH+O I Msel+Q fragment in the corresponding individual BAC fingerprint was made in 
silico by searching fragments of comparable length within a certain error margin. The 

1 5 contigs were generated with a cut-off parameter set at various values between 10" and 1 0" 
and the resulting contig evaluated, Tolerance was set at 2-4. ( i.e. 0.2-0.4 basepairs) 



Results and conclusions 
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Genetic markers have been linked to BAC contigs in a very efficient way and 
multiple genetic markers have been combined into an integrated physical and genetic map. 
The concept is applicable to different species with varying genome sizes with success, and 
without prior knowledge of the specific genomic sequence- The method provides for the 
5 effective integration of ph.ysical.and genetic maps and will have an important impact in the 
genomic arena. 
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Claims 

1 . A method for providing an integrated genetic and physical map of a genome 

or part thereof; the method comprising the steps of: 
5 (a) providing at least two individual genetic markers for the genome or 

part thereof, preferably in the form of a genetic map; 

(b) characterising the genetic markers by means of at least one AFLP 
fragment identified through AFLP fingea^printing; 

(c) providing a library of clones comprising fragments of the genome or 
1 0 part thereof preferably an artificial chromosome library such as a 

w * BACorYAC; 

(d) generating a multitude of pools, each pool containing a multitude of 
individual clones from the library, 

(e) generating an AFLP fingerprint for each of the pools; 

15 CO identifying in the multitude of pools a pool in which an AFLP 

fragment identified in (b) is present in the fingerprint of the pool; 
(g) generating an AFLP fingerprint for each of the individual clones in 
. the pool identified in (f), and identifying the clone containing the 
AFLP fragment identified in (b) in its AFLP fingerprint; 
20 (h) generating a contig comprising the individual clone identified in step 

(g); 

(i) repeating steps (f-h) for at least a second AFLP fragment identified 
in (b) whereby the second or further AFLP fragments characterise a 
second or further genetic marker; and, 
25 (j) linking at least two of the contigs obtained in (h) to thereby obtain an 

integrated physical and genetic map of the genome or part thereof, 
which comprises at least two genetic markers; 
wherein the forward and reverse AFLP primers used in step (b) and (e) comprise K 
respectively L selective nucleotides at the 3'-end of the primers, wherein the forward 
3 0 and reverse AFLP primers used in step (g) comprise M respectively N selective 

nucleotides at the 3 '-end of the primers, wherein K, L, M, N are integers from 0 to 
10 3 and wherein K-KL ^M+N, 
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2. A method for linking a genetic marker to a physical marker in a genome or 

part thereof, the method comprising the steps of: 

(a) characterising the genetic marker by m eans of at least one AFLP 
5 fragment identified through AFLP fingerprinting; 

(b) providing a library of clones comprising fragments of the genome or 
part thereof, preferably an artificial chromosome library such as a 
BACorYAC; 

(c) generating a multitude of pools, each pool containing a multitude of 
1 o individual clones from the library; 

(d) generating an AFLP fingerprint for each of the pools; 

(e) identifying in the multitude of pools a pool in which an AFLP 
fragment identified in (a) is present in the fingerprint of the pool; 

(f) generating an AFLP fingerprint for each of the individual clones in 
X5 the pool identified in (e), and identifying the clone containing the 

AFLP fragment identified in (a) in its AFLP fingerprint; 

(g) generating a contig comprising the individual clone identified in step 
(f), thereby linking the genetic marker to a physical marker; 

wherein the forward and reverse AFLP primers used in step (a) and (d) comprise K 
20 respectively L selective nucleotides at the 3 '-end of the primer, wherein the forward and 
reverse AFLP primers used in step (f) comprise M respectively N selective nucleotides at 
the 3 '-end of the primer, wherein K, L, M 3 N are integers from 0 to 10, and wherein K+L > 
M+N, 

25 3, Method according to claim 2, wherein steps (a)-(g) are repeated for further 

genetic markers in the genome or part thereof and wherein the contigs obtained in 
(g) are aligned to thereby obtain an integrated physical and genetic map. 

4. Method according claims 1-3, wherein (K+L) - (M+N) is at least 1, 

30 preferably at least 2, more preferably at least 3, most preferably at least 4. 



5. Method according claims 1-4, wherein M+N is at least 0, preferably at least 

1, more preferably at least 2, most preferably at least 3 . 
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6. Method according claims 1-5, wherein, each pool contains at most 0.6 
genome equivalent of the total genome to be analysed, preferably 0,5 more 
preferably 0.4, most preferably 0.3, 

5 

7. Method according to claims 1-6, further comprising an additional pooling 
step . 

8, Method according to claims 1-7, wherein the genetic markers are provided 
1 0 with a density of at least one genetic marker per 100 kb. 

9, Method according to claims 1-8, wherein the contigs are aligned using a 
computer program suitable for aligning such as FPC 

15 10. Method according to claims 1-9, wherein the artificial chromosome library 

contains at least 5 genome equivalents. 

11. Use of AFLP primers for the integration of genetic and physical maps. 



20 12. Use of AFLP in linking genetic and physical genome maps. 

13 . Use of AFLP primers in linking genetic and physical genome maps. 

14. Use of a first and a secondpair of AFLP primers in a method for linking 

25 genetic and physical genome maps wherein the first pair of AFLP primers comprise 

K respectively L selective nucleotides at the 3 '-end of the primer, wherein the 
second pair of AFLP primers used comprise M respectively N selective nucleotides 
at the 3 '-end of the primer^ wherein K, L, M, N are integers from 0 to 10, and 
wherein K+L ^M+N. - 



30 
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